Read access and storage circuitry read allocation applicable to a cache

ABSTRACT

A read allocation indicator (e.g. read allocation signal  30 ) is provided to storage circuitry (e.g. cache  22 ) to selectively determine whether read allocation will be performed for the read access. Read allocation may include 
         modification of the information content of the cache ( 22 ) and/or modification of the read replacement algorithm state implemented by the read allocation circuitry ( 70 ) in cache ( 22 ). For certain types of debug operations, it may be very useful to provide a read allocation indicator that ensures that no unwanted modification are made to the storage circuitry during a read access. Yet other types of debug operations may want the storage circuitry to be modified in the standard manner when a read access occurs.

FIELD OF THE INVENTION

The present invention relates in general to storage circuitry read allocation, and more particularly to storage circuitry read allocation applicable to a cache.

RELATED ART

In prior art data processing systems, a read access which misses in a-cache will cause a cache line in the cache to be allocated according to the cache's read replacement algorithm This is the behavior that is typically desired during normal operation of the cache. However, this behavior can be problematic under certain circumstances.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is illustrated by way of example and not limited by the accompanying figures, in which like references indicate similar elements, and in which:

FIG. 1 illustrates, in block diagram form, a data processing system in accordance with one embodiment of the present invention;

FIG. 2 illustrates, in flow diagram form, a read access to a cache in accordance with one embodiment of the present invention; and

FIG. 3 illustrates, in flow diagram form, a debug operation which requires a read access to storage circuitry in accordance with one embodiment of the present invention.

Skilled artisans appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the elements in the figures may be exaggerated relative to other elements to help improve the understanding of the embodiments of the present invention.

DETAILED DESCRIPTION

In prior art data processing systems, a read access by a debugger which misses in an L2 cache will cause a cache line in the L2 cache to be allocated, potentially replacing a currently valid and potentially modified cache line. This may be the desired result if the debugger is single-stepping through a series of instructions to be executed. However, this may cause a problem if the debugger is reading the contents of the cache merely for visibility purposes. The present invention addresses this problem, as well as a number of other problems relating to read access and storage circuitry read allocation.

FIG. 1 illustrates a data processing system 10 in accordance with one embodiment of the present invention. In one embodiment, data processing system 10 includes debug circuitry 12, which is bi-directionally coupled to switching circuitry 20 by way of signals 36 and which provides read allocation signal 30 to switching circuitry 20. In one embodiment, data processing system 10 also includes processor 14, which is bi-directionally coupled to switching circuitry 20 by way of signals 38 and which provides read allocation signal 31 to switching circuitry 20. In one embodiment, data processing system 10 also includes direct memory access (DMA) 16, which is bi-directionally coupled to switching circuitry 20 by way of signals 40 and which provides read allocation signal 32 to switching circuitry 20. In one embodiment, data processing system 10 also includes bus master 18, which is bi-directionally coupled to switching circuitry 20 by way of signals 42 and which provides read allocation signal 33 to switching circuitry 20. In one embodiment, data processing system 10 also includes L2 cache 22, which is bi-directionally coupled to switching circuitry 20 by way of signals 44 and which receives read allocation signal 34 from switching circuitry 20. In one embodiment, data processing system 10 also includes peripheral 24, which is bi-directionally coupled to switching circuitry 20 by way of signals 46 and which receives read allocation signal 35 from switching circuitry 20.

Note that in one embodiment of the present invention, one of read allocation signals 30-33 is provided by switching circuitry 20 as read allocation signal 34, depending upon which one of 12, 14, 16, or 18 initiates a read access to L2 cache 22. Note that in one embodiment of the present invention, one of read allocation signals 30-33 is provided by switching circuitry 20 as read allocation signal 35, depending upon which one of 12, 14, 16, or 18 initiates a read access to peripheral 24.

In one embodiment of the present invention, processor 14 includes an L1 cache 15 that is coupled to processor 14. Although the L1 cache 15 has been shown as being more directly coupled to processor 14 than the L2 cache 22, alternate embodiments of the present invention may have any number of caches that are coupled to processor 14 in any desired manner. In alternate embodiments of the present invention, the L1 cache 15 and the L2 cache 22 need not be caches, but may be any type of storage circuitry that has read allocation capability. In one embodiment of the present invention, L2 cache 22 is bi-directionally coupled to system memory 26 by way of conductors 48.

In one embodiment of the present invention, read allocations signal 30 and signals 36 are portions of a bus 60; read allocations signal 31 and signals 38 are portions of a bus 61; read allocations signal 32 and signals 40 are portions of a bus 62; read allocations signal 33 and signals 42 are portions of a bus 63; read allocations signal 34 and signals 44 are portions of a bus 64; and read allocations signal 35 and signals 46 are portions of a bus 65. Buses 60-65 may be S the same bus, or may be one or more subsets of a common bus having a common bus protocol. Alternately, different ones of buses 60-65 may use the same bus protocol or different bus protocols. In alternate embodiments of the present invention, interconnect circuitry 50 may not use any bus structure or bus protocol. In one embodiment of the present invention, interconnect circuitry 50 includes switching circuitry 20 and buses 60-65; however, interconnect circuitry 50 can be implemented in any manner. As used herein, the term “bus” is used to refer to a plurality of signals or conductors which may be used to transfer one or more various types of information, such as data, addresses, control, or status.

In one embodiment of the present invention, L2 cache 22 includes read allocation circuitry 70 and write allocation circuitry 74 which are each bi-directionally coupled to storage element 72. Storage element 72 may be used to store information, such as data or instructions that have been cached, as well as history bits and cache tags for some embodiments of the present invention. In alternate embodiments of the present invention, storage element 72 may be implemented in any desired manner and may store any desired information.

In the embodiment of the present invention illustrated in FIG. 1, debug circuitry 12, processor 14, direct memory access (DMA) 16, and bus master 18 may each be an access initiator which initiates a read access to L2 cache 22. Alternate embodiments of the present invention may use different configurations. For example, alternate embodiments of the present invention may have fewer, more, or different blocks of circuitry than 12, 14, 16, and 18 which initiate read accesses to L2 cache 22. Bus master 18 may be any type of circuitry that is capable of initiating a read access to the L2 cache 22 across buses 63 and 64. If interconnect circuitry 50 does not use a bus having a bus protocol, circuitry 18 may not be a bus master, but instead may be any type of circuitry that is capable of initiating a read access to the L2 cache 22.

FIG. 2 illustrates, in flow diagram form, a read access to an L2 cache 22 (see FIG. 1) in accordance with one embodiment of the present invention. The flow starts at start oval 100. The flow then continues to step 101 where a read access is initiated. The flow continues to step 102 where a read allocation signal corresponding to the read access is provided. Referring to FIG. 1, any one of circuitry 12, 14, 16, or 18 may initiate a read access and will subsequently provide the read allocation signal. The flow then continues to decision diamond 103 where the question is asked “is the read access a cache-inhibited read access?”. If the answer to decision diamond 103 is yes, the flow continues to step 104 where the cache-inhibited read access is completed. From step 104, the flow finishes at end oval 110. If the answer to decision diamond 103 is no, the flow continues to decision diamond 105 where the question is asked “does the read allocation signal indicate that no read allocation is to be performed?”. If the answer to decision diamond 105 is no, the flow continues to step 106 where the cacheable read access is completed while allowing the contents of the cache and the state of the read: allocation circuitry to be modified. From step 106, the flow finishes at end oval 110. If the answer to decision diamond 105 is yes, the flow continues to decision diamond 107 where the question is asked “does the read access result in a cache miss?”. If the answer to decision diamond 107 is no, the flow continues to step 108 where the cacheable read access is completed without modifying the state of the read allocation circuitry. If the answer to decision diamond 107 is yes, the flow continues to step 109 where the cacheable read access is completed without modifying the contents of the cache and without modifying the state of the read allocation circuitry. In alternate embodiments of the present invention, decision diamond 103 is optional. If decision diamond 103 is not used, the flow continues from step 102 to decision diamond 105.

FIG. 3 illustrates, in flow diagram form, a debug operation which requires a read access to storage circuitry in accordance with one embodiment of the present invention. In one embodiment of the present invention, debug circuitry 12 of FIG. 1 may be used to initiate and perform the debug operation. The flow starts at start oval 200. The flow then continues to step 201 where a debug operation which requires a read access is initiated. The flow continues to decision diamond 202 where the question is asked “is the debug operation a single-step operation?”. If the answer to decision diamond 202 is yes, the flow continues to step 203 where a read allocation signal corresponding to the read access is provided to indicate that read allocation is to be performed. From step 203, the flow continues to step 204 where the read access with read allocation is performed. From step 204, the flow continues to step 205 where the debug operation is completed. From step 205, the flow finishes at end oval 206. If the answer to decision diamond 202 is no, the flow continues to step 207 where a read allocation signal corresponding to the read access is provided to indicate that no read allocation is to be performed. From step 207, the flow continues to step 208 where the read access without read allocation is performed. From step 208, the flow continues to step 205 where the debug operation is completed. From step 205, the flow finishes at end oval 206.

There are certain applications, such as, for example, certain types of debug operations, where it is advantageous for read accesses performed to system memory 26 to be unobtrusive to one or more of the caches (e.g. L2 cache 22). This requires that no modification of cache state and/or contents should occur when a read access is performed. This is important, since debugging of the data processing system may cause other undesired side-effects if cache state and/or contents are modified. In addition, side-effects may cause the problem being debugged to change character, or to not be manifested, which would prevent debugging from being successful. On the other hand, there are certain applications, such as, for example, other debug operations (e.g. single-stepping through user application instructions) where it is advantageous for read accesses performed to system memory 26 to cause the same modifications to be made to one or more of the caches (e.g. L2 cache 22) as would occur during normal operation. It is thus very useful to be able to selectively perform read allocation in a cache or other storage circuitry based upon the debug operation type.

In one embodiment of the present invention, a read allocation signal (e.g. 30, 31, 32, or 33) is used as a read allocation indicator to indicate whether or not read allocation is to be performed (e.g. in cache 22). If read allocation does not occur, then a read access is performed without modifying the information content of cache 22 and/or without modifying the read allocation state of cache 22. Some embodiments of the present invention prevent modification of the information content of cache 22, some embodiment of the present invention prevent modification of the read allocation state of cache 22, and some embodiments of the present invention prevent both modification of the information content of cache 22 and the read allocation state of cache 22. The content of cache 22 is the information that is stored in storage element 72 (see FIG. 1). The read allocation circuitry 70 is used to implement the read replacement algorithm used in cache 22. A read replacement algorithm is the rule or set of rules that are used to determine which cache line will be replaced next when the cache misses on a read access. The present invention may be used with any desired read replacement algorithm. Thus, the read replacement algorithm implemented by read allocation circuitry 70 may be any desired replacement algorithm. The replacement algorithm steps through a series of states as each cache miss occurs in order to always have an updated value for which cache line should be filled for the next subsequent cache miss on a read access.

For some embodiments of the present invention (e.g. embodiments using one or more selected types of debug operations), it is not enough to prevent modification of the information content of the cache; the current state of the read replacement algorithm (e.g. the read allocation state) of read allocation circuitry 70 must also not be affected. Note that some embodiments of the present invention may optionally have write allocation circuitry 74 which determines the behavior of cache 22 for write accesses. Alternate embodiments of the present invention may not even have write allocation circuitry 74. In some embodiments of the present invention, the operation and replacement algorithm of the read allocation circuitry 70 is totally independent of the operation and replacement algorithm of the write allocation circuitry 74. Some embodiments of the present invention may have common replacement algorithms for both read and write cache misses. In such embodiments, read allocation circuitry 70 and write allocation circuitry 74 may be combined into a single and/or common circuitry, although it is important to note that read allocation and write allocation present entirely different problems to a user or debugger of data processing system 10.

Although the present invention has been described in the context of specific embodiments that use a cache 22, alternate embodiments of the present invention may use any type of storage circuitry that has read allocation capability. The present invention is not limited to cache applications. Also, although the present invention is useful for debug operation, the present invention may be used for any desired purpose and is not limited to use in debug operations. As one possible example of a different use, processor 14 may selectively determine that read allocation should not be performed in one or more caches because the data in the read access is not likely to be reused, thus it may be advantageous to prevent the read access from causing replacement of potentially more useful information already contained in a cache.

Note that the present invention may also be used with the L1 cache 15. For this embodiment, a read allocation signal (not shown) may be provided from processor 14 to the L1 cache 15. This read allocation signal (not shown) may function in a same or similar manner to read allocations signals 30-35. Also, the present invention may be used with peripheral 24. The read allocation signal 35 may function in a same or similar manner to read allocation signal 34 provided to L2 cache 22.

In the foregoing specification, the invention has been described with reference to specific embodiments. However, one of ordinary skill in the art appreciates that various modifications and changes can be made without departing from the scope of the present invention as set forth in the claims below. Accordingly, the specification and figures are to be regarded in an illustrative rather than a restrictive sense, and all such modifications are intended to be included within the scope of present invention.

Benefits, other advantages, and solutions to problems have been described above with regard to specific embodiments. However, the benefits, advantages, solutions to problems, and any element(s) that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as a critical, required, or essential feature or element of any or all the claims. As used herein, the terms “comprises,” “comprising,” or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. 

1-26. (canceled)
 27. A method for performing a read access, comprising: initiating a debug operation requiring a read access to a cache and having a debug operation type; and selectively performing read allocation of the cache during the read access, based at least on the debug operation type.
 28. The method of claim 27, wherein read allocation comprises at least one of modifying contents of the cache and modifying a read allocation state of the cache.
 29. The method of claim 28, wherein when the debug operation type is a first type, read allocation is performed during the read access and when the debug operation type is a second type, read allocation is not performed during the read access.
 30. The method of claim 28, wherein when the debug operation type is a first type, a read allocation indicator corresponding to the read access is provided to the cache to indicate that read allocation is to be performed for the read access, and when the debug operation is a second type, a read allocation indicator corresponding to the read access is provided to the cache to indicate that no read allocation is to be performed for the read access.
 31. An apparatus, comprising: a cache; interconnect circuitry coupled to the cache; and debug circuitry, coupled to the interconnect circuitry, the debug circuitry initiating a debug operation requiring a read access to the cache and providing a read allocation indicator to the cache corresponding to the read access via the interconnect circuitry.
 32. The apparatus of claim 31, wherein the cache, in response to the read allocation indicator, selectively performs read allocation during the read access.
 33. The apparatus of claim 32, wherein the cache, in response to the read allocation indicator and based on the debug operation, selectively performs read allocation during the read access.
 34. The apparatus of claim 31, wherein when the read allocation indicator has a first value, the read access is performed with read allocation and when the read allocation indicator has a second value, the read access is performed without read allocation.
 35. The apparatus of claim 31, wherein the interconnect circuitry comprises switching circuitry to route the read allocation indicator from the debug circuitry to the cache.
 36. The apparatus of claim 31, wherein the interconnect circuitry comprises a system bus coupled to the debug circuitry and to the cache, and wherein the read allocation indicator is provided via the system bus. 